The Parisi ultrametricity conjecture. 



Dmitry Panchenko 



* 



December 6, 2011 



Abstract 



In this paper we prove that the support of a random measure on the unit ball 
of a separable Hilbert space that satisfies the Ghirlanda-Guerra identities must be 
ultrametric with probability one. This implies the Parisi ultrametricity conjecture in 
mean-field spin glass models, such as the Sherrington-Kirkpatrick and mixed p-spin 
models, for which Gibbs' measures are known to satisfy the Ghirlanda-Guerra identities 
in the thermodynamic limit. 
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1 Introduction and main result. 

Let us consider a random probability measure G on the unit ball of a separable Hilbert 
space H. We will denote by {(J^)i>i an i.i.d. sequence from this measure, by (■) the average 
with respect to G®°° and by E the expectation with respect to the randomness of G. Let 
Ri,i' = o"^ ■ a' be the scalar product, or overlap, of cr' and a' . Random measure G is said to 
satisfy the Ghirlanda-Guerra identities if for any n > 2, any bounded measurable function 
/ of the overlaps {Ri^v)i^v<n and any bounded measurable function ip of one overlap, 



Another way to express the Ghirlanda-Guerra identities is to say that under measure EG®°°, 
conditionally on i?" = {Riji)i^ii<n the law of -Ri,n+i is given by the mixture 



E(/V^(/2i,„+i)) = -E(/)E(V^(i?i,2)) + -> E(/^(i?i,0)- 




(1.1) 



1=2 




(1.2) 



1=2 



where ^ is the law of Ri^2 under EG"^^. We will prove the following. 
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Theorem 1. Under U.l\) . the distribution of {Ri^ir)i^ir>i is ultrametric, i.e. 



E(/(i?i,2>min(/2i,3,i?2,3))> = 1. 



(1.3) 



It is known (Theorem 2 in [7]) that if a measure G satisfies the Ghirlanda-Guerra identities 
and if q* is the supremum of the support of yU then with probabihty one the support of G 
belongs to the sphere of radius ^/(f in H . Therefore, fll.Sp means that with probability one 
over the choice of random measure G, the distances on the Hilbert space H between three 
independent replicas (T^,c7^ and sampled from G must satisfy the ultrametric inequality, 



Examples of random measures satisfying the Ghirlanda-Guerra identities arise in several 
mean-field spin glass models, such as the Sherrington-Kirkpatrick model [13] and mixed p- 
spin models, for which measures G are defined as the asymptotic analogues of the Gibbs 
measures in the thermodynamic limit by way of the Dovbysh-Sudakov representation [3]. 
Originally, the Ghirlanda-Guerra identities (11. ip were proved in j4j on average over the 
inverse temperature parameter and later, in a closely related formulation, by introducing a 
small perturbation term to the Hamiltonian of the model ([S]), but in some cases can be 
proved in a strong sense without perturbation ([8]). 



The ultrametricity (II. 3p of the overlap array {Ri,v) was famously conjectured by G. 
Parisi in [TTj, [12\ as the cornerstone of a theory developed during the discovery of the 
Parisi formula for the free energy in the Sherrington-Kirkpatrick model (see |6] for more 
details). The Parisi formula was proved rigorously in a celebrated work of M. Talagrand 
in [15] following the breakthrough invention of the replica symmetry breaking interpolation 
scheme by F. Guerra in [5], which gave a very strong indirect support to the entire Parisi 
ansatz including the ultrametricity conjecture. More recently, several results providing some 
direct mathematical support to the Parisi ultrametricity conjecture were proved under an 
additional technical assumption that the overlaps take only finitely many values, i.e. R12 € 
{qi, . . . with probability one for some non-random values {qi)i<k- The first such result 
was proved by L.-P. Arguin and M. Aizenman in [^ as a consequence of the Aizenman- 
Contucci stochastic stability property [1] of the Gibbs measures in the mixed p-spin models. 
Inspired by [2], the author proved a similar result based on the Ghirlanda-Guerra identities 
in [7] (see [H] for an elementary proof) and M. Talagrand gave a different proof in [TB] . 
Unfortunately, in the Sherrington-Kirkpatrick and mixed |)-spin models one expects the 
distribution of the overlap to have a continuous component ([6]) so the results in [2], [7], and 
[T6] were not directly applicable to these models. In this paper we deduce ultrametricity (11. 3p 
without any assumptions on the distribution of the overlap by utilizing a new representation 
of the Ghirlanda-Guerra identities proved in Theorem [2] below, which can be viewed as 
a new invariance principle for random measures that satisfy (II. ip . The idea behind this 
representation was originally motivated by the stability property proved in [10] which unified 
the Aizenman- Contucci stochastic stability and the Ghirlanda-Guerra identities; however, 
the proof we give here is based only on the Ghirlanda-Guerra identities. 

Acknowledgement. The author would like to thank Michel Talagrand for constant 
encouragement of the efforts that lead to this work. 




(1.4) 
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2 Invar iance principles. 

In this section, we will first prove a new invariance property for random measures that satisfy 
the Ghirlanda-Guerra identities in Theorem [2] and then deduce from it a modified version 
of the invariance principle in Theorem |3] which will be used in the proof of Theorem [1] in 
Section |3l Given n > 1, consider n bounded measurable functions /i, ...,/„: M M and 
define 

F(a, a\ . . . , cr'^) = ■ a^) + . . . + ■ a"^). (2.1) 
For 1 < / < n we define 

Fi{a,a\..., a") = F(a, . . . , a") - /^a ■ a') + E(/,(i?i,2)) (2.2) 

and for / > n + 1 we define 

FKa,a\...,a") = F(a,a\...,a"). (2.3) 

The definition (12.31) for / > n + 1 will not be used in the statement but will appear in the 
proof of the next result. Let us recall the notation i?" = {Ri.i>)i.i><n- 

Theorem 2. Suppose U.l\) holds and let ^ be a bounded measurable function of R^. Then 

^/<l>expV" F;(a',crS...,a")\ , 

where the average (■)_ in the denominator is in a only for fixed cr^, . . . , a" and the outside 
average of the ratio is in a^, . . . ,a^. 

When n = 1, it is understood that $ is a constant. Notice that one can easily recover the 
original Ghirlanda-Guerra identities from (12. 4p by taking fi = ti/j and /2 = • • • = /n = and 
computing the derivative at t = 0. 

Proof. Without loss of generality, let us assume that $ takes values in [0, 1] and suppose 
that \fi\ < L for 1 < / < n for some large enough L. For t > let 

^(i)=E(^°;'Pg:i'^''"'"'--V"">\. (2.5) 

' \ {exptF{a,a\...,a'')y_ I ^ ' 

We will show that the Ghirlanda-Guerra identities (11. ip imply that this function is constant, 
thus, proving the statement of the theorem, 99(0) = v^(l). If for A; > 1 we denote 

n+k-l 

D„+,= Fi{a\a\...,a'')-{n + k-l)Fn+k{a''^\a\...,a^) 
1=1 

then one can easily compute by induction that (recall (12. 3p and that we average in a only 
in the denominator of (12. 4p ) 

(fc)/^^ _ ^ / ■ ■ ■ Dn+k exp Y.l=i iFi{a\ a\...,a" 

^ \ (expitF(a,ai,...,fT"))^+'= 



First, let us notice that ip'-'^\0) = 0. Indeed, if we denote $' = $i?n+i • • • Dn+k-i then $' is 
a function of the overlaps {Ri^i>)i^i><n+k-i and 



n+k-l 



1=1 

n n+k—1 

^e($'( J2 mj,i)+mARi,2))-{n + k-l)f,{R,,n+k))) = (2.6) 



j=l l^j,l=l 



by the Ghirlanda-Guerra identities (11.11) applied to each term j. Now, since \Fi\ < Ln and 
< 2L(n + k — l)n we get 



- (n-(-'-)")K^^iilS^ 

= n^^ + ^~^)(2Ln)V(t)- 



1=1 

Consider arbitrary T > 0. Again, using that < Ln it is obvious that Lp{t) < e^^-^"^ for 
< t < T and, therefore, 

|¥,W(t)| < e'LTn- (n + ^- ^y- (2Ln)^ 
(^n i). 

By (12.61) and Taylor's expansion 

Letting A; — )■ oo we get that if>{t) = (p{0) for t < {2Ln)^^. Therefore, for any to < {2Ln)~^ 
we again have 99'^^^ (to) = for all > 1 and by Taylor's expansion for to < t < T, 

Letting k 00 proves that ip{t) = (p{0) for < t < 2{2Ln)-^ . We can continue in the same 
fashion to prove this equality for all < t < T and note that T was arbitrary. 

□ 

Let us write down a corollary of Theorem [2] on which the proof of Theorem [1] will be based. 
Consider a finite index set A. Given n> 1 and configurations ct\ . . . , a", let (-Ba)ae^ be a 
partition of the Hilbert space H such that for each a & A the indicator Ib^ = I{o^ G Ba) is 
a measurable function of i?" and (a • a^)i<n and let 

W^ = W^{a\...,a^) = G{B^). (2.7) 
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Let us define a map T by 

W = (W.U, nw) ^ ( <^f- '^''^'f' ■ ■ ■ ■„';;»- ) . (2.8) 
The following holds. 

Theorem 3. Under ( (i.ij) . /or ani/ hounded measurable function ip : M"^ x RI-^I — )• M, 

' " \ (expF(a,(Ti,...,a"))^ / ^ ' 

Proof. For each a G ^ let us take integer tIq, > and let m = n + X^ag^'^a- Let {Sa)a<^A 
be any partition of {n + l,...,m} such that |S'q,| = Ua- Consider a continuous function 
$ : R"' ^ R and let = H^g^ where 

= /((j' G 5«,V/ G (2.10) 

Let /; for / < n be as in (12. ip and = . . . = /m = 0. Let us now apply Theorem [2] with 
these choices of functions $' and ji (and n = m). First of all, integrating out the coordinates 
((j')i>„, the left hand side of (12. 4|) can be written as 

E($') =E($(i?") JJv'a) =E($(i?") n^"'^(^''---'^")) (2-11) 

where W^s were defined in (12. 7p . Let us now compute the right hand side of (12. 4p . Since 
fn+i = . . . = /m = 0, the denominator will be (exp F{a, a^, . . . , cr")) and 



n m 



^ FKa', a\ . . . , a'") = J] FKa', a\ . . . , a") + 5^ F(a', a\ . . . , a'^). (2.12) 

Z = l Z=l i = 71+l 

Since the denominator does not depend on (cr')i>„, integrating in the coordinate a' for / G Sa 
will produce a factor 

{lB^expF{a,a\...,a^))_. 
For each a G ^ we have \Sa\ = ria such coordinates and, therefore, the right hand side of 



dM]) is equal to 

^/ $(i?") exp Er=i -F^K^', a\ . . . , g") -,-r / (/b. exp F(a, a^ . . . , a"))_ y^\ 
\ (expF(a,(Ti, . . . ,a"))^ J-^H (exp F(cj, . . . , a"))_ / /' ^ ' ' 

Comparing with (12. lip , recalling (12. 8 p and approximating a continuous function on [0, l]''^' 
by polynomials we get (12. 9 p first for products $(/?")(/)( VF), then for continuous functions 
ip{R'^,W) and then for arbitrary bounded measurable functions. 

□ 
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3 Proof of Theorem [T]. 



We mentioned in the introduction that G is concentrated on the sphere of radius so all 
(J below will be of length ||(t|| = ^/q*. Consider a symmetric non-negative definite matrix 
^ = {0'i,i')i,i'<n such that ai^i = q* for / < n. Given 5 > 0, we will write x ~ a to denote that 
X G {a — e,a + e) and K^^Ato denote that Ri^i' ~ ai^u for all I ^ I' < n and, for simplicity 
of notation, we will keep the dependence of ~ on e implicit. Below, the matrix A will be 
used to describe a set of constraints such that the overlaps in i?" can take values close to A, 

E(/(i?" ^ v4)) > 0, (3.1) 

for a given e > 0. Let us introduce the notation 

< = max(ai,„, . . . , a„-i,n)- (3.2) 

The main step in the proof of Theorem [1] is the following result which will be based on the 
invariance principle of Theorem [31 

Theorem 4. Under given e > 0, if the matrix A satisfies liS. 1\) and a* + e < g* then 

E{^/(^i?" ^ A,^l,n+l ~ ai^n, ■ ■ ■ , Rn-l,n+l ~ On-l.n, i?n,n+l < < + £)^ > 0. (3.3) 

Theorem m will be used in the following way. Suppose that the matrix A is such that a* < q* 
and A is in the support of the distribution of i?" under E,G^°° which means that (13. ip holds 
for all £ > 0. Since a* + £ < g* for small £ > 0, (13. 3p holds for all £ > 0. Therefore, the 
support of the distribution of K^'^^ under KG^°° intersects the event in (13. 3p for every £ > 
and since the support is compact it contains a point in the set 

■ = (^1,1' for ^' < flz.n+i = for / < n - 1, a^ „^i < <|. (3.4) 

Proof of Theorem [41. We will prove ( 13. 3p by contradiction, so suppose that the left hand 
side is equal to zero. We will apply Theorem [31 with A = {1, 2} and the partition 

Bi = {a : a ■ a"" > al + e], B2 = B1. 

Since we assume that a* + £ < g*, the set Bi contains a small neighborhood of o"" and on 
the event {i?" ~ A} its complement B2 = B^ contains small neighborhoods of cr^, ... , cr"~^ 
since Ri^n < din + £ < a* + £ and, thus, on this event for cr^, a"" in the support of G the 
weights Wi = G{Bi),W2 = G{B2) = 1 - Wi are strictly positive. Then, (13.11) implies that 
we can find < p < p' < 1 and small 6 > such that 

E{I{R'' ^ A,Wi e {p,p'))) >6. (3.5) 

Let us apply Theorem [31 and (12.91) with the above partition, the choice of 

^(i?", W) =I{R''^ A, Wi e {p,p')) (3.6) 
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and the choices of functions fi = ... = = and = tl{x > a* + e) for t G M. The 

terms that appear on the right hand side of (12.91) will become 



n-l 



J2Fi{a\a\...,a'') = ^t/(/2i,„ > < + e) + tE(/(/2i,2 > < + e)) 
1=1 1=1 

= tE(/(i?i,2>< + e)>=:t7 
since, again, on the event {i?" ~ A} the overlaps Ri^n < cii,n + e < a*^ + e ioi I < n — 1 and 
(exp F(a, a\ . . . , a"))_ = (exp tl{a ■ a" >< + £))_ = ^t{W) 

where 

/^t{W) = G{Bi)e' + G{B2) = W^e' + I - Wi. (3.7) 
If W = (Wi, W2), the map Tt(W) corresponding to (12. 8p can now be written as 

.AtiWy AtiW)J' 

Since At(W) > 1 for t > 0, in this case equation (12. 9 p together with (13.50 implies 

./ /(fi"«A(T.(iy))ie(P.P'))e" " 



(3.8) 



' ^ n A,(M/). 

< E(/(fl»» e (P,p'))e'^). (3.9) 

In the average (■) on the right hand side let us fix c^, ... , cr""^ and consider the average with 
respect to cr" first. Clearly, on the event {K^ ~ A} such average will be taken over the set 

f}((j\ . . . , ct"-1) = {a : a - a' ^ 0^,^ for / < n - l}. (3.10) 

Let us look at the diameter of this set on the support of G. Suppose that with positive 
probability over the choice of the measure G and replicas cr^, . . . , cr"~^ from G satisfying 
the constraints in A (i.e. Ri^i> ~ a/^// for 1,1' < n — 1) we can find two points a', a" in the 
support of G that belong to the set Q{a^, . . . , cr"^^) and such that a' ■ a" < a* + e. This 
would then imply (13. 3p since for (cr"', cr""*"^) in a small neighborhood of (cr', a") the vector 
(cr^, . . . , cr", 0""+^) would belong to the event 

~ A, Rl^n+l ~ fll.ni • • • ) -Rn-l,n+l ~ 'J^n-l,n) Rn,n+1 < + ^ } 

on the left hand side of (13. 3p . Since we assume that the left hand side of (13. 3 p is equal to 
zero, we must have that with probability one over the choice of the measure G and replicas 
a^, . . . , (t"~^ satisfying the constraints in A any two points cr', a" in the support of G that 
belong to the set fl{a^, . . . , cr"~^) satisfy a' ■ a" > a* + £. Now, let us also recall that in (13. 9p 
we are averaging over cr" that satisfy the condition {Tt{W))i G {p,p')- If we fix any such a' 
in the support of G that satisfies this condition and belongs to the set (I3.10p . then the Gibbs 
average in a" will be taken over its neighborhood Bi = -Bi(cr') = {a" : a' ■ a" > a* + e} of 
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measure Wi = Wi{a') = G{Bi{a')) that satisfies (Tt(W))i G {p,p')- One can easily check 
that the map in (13. 8 p satisfies Tf^ = T_t and using this for (Tt(W))i G {p,p') implies that 



Wi{a')e{—^ :qE{p,p')] 

Iqe ^ + 1 — q J 



and, thus, Wi{a') < (1 — p') *. This means that the average on the right hand side of 
(13. 9 p over a" for fixed a^, . . . , cr"~^ is bounded by (1 — p')~^e~*e*'*' and, therefore, for t > 




0<6< E{I{R'' ^ A, {Tt{W))i G {p,p'))e''') < (1 - p')-^e-"^^-''\ (3.11) 

Since A satisfies (131]) . 1 - 7 = E(/(i?i,2 < a* + e)) > and letting t +00 in (13. lip we 
arrive at contradiction. 

□ 

Proof of Theorem [1]. The proof is again by contradiction. Suppose that ultrametricity is 
violated in which case there exist a < b < c < q* such that the matrix 



(3.12) 



is in the support of the distribution of under EG®°° so it satisfies (13.11) for every e > 0. 
In this case Theorem H] implies the following. Given any ni, 712, ^3 > 1 and n = rii + n2 + 
we can find a matrix A in the support of the distribution of i?" under KG®'^ such that for 
some partition of indices {1, . . . , n} = /i U /2 U /a with \Ij\ = rij we have j G Ij for j < 3 and 

(a) ai,i' < c for all / 7^ /' < n, 

(b) ai^ii = a if / G /i, /' G /2, ai^i' = 6 if / G /i, /' G I3 and ai^i' = c if / G /2, /' G I3. 

This can be proved by induction on ni, 77,2, ^3. First of all, by the choice of the matrix (I3.12p 
this holds for rii = 77-2 = 773 = 1. Assuming the claim holds for some 771,772 and 773 with 
the matrix A, let us show how one can increase any of the 77j's by one. For example, let us 
assume for simplicity of notation that n E I3 and show that the claim holds with 773 + 1. 
Since a* < c < g* , we can use the comment below Theorem H] to find a matrix A' in the 
support of the distribution of under EG"^°° that belongs to the set (13. 4p . Hence, a^^, < c 
for alH 7^ < 7i + 1 and a'l „_,_i = ai^n for /< 77 — 1 so, in particular, a'l = 6 if / G /i 
and a'l^^^ = c if / G /2 which means that A' satisfies the conditions (a), (b) with J3 replaced 
by /3 U {77 + 1}. In a similar fashion, one can increase the cardinality of Ji and I2 which 
completes the induction. Now, let 771 = 772 = 773 = m, find the matrix A as above and find 
a^, . . . , a" on the sphere of radius y/q* such that i?/^// = a;^;/ for all /, /' < 77. Let be the 
barycenter of the set {a^ : / G /j}. Condition (a) implies that 



^ /||2 , 1 D ,mq* + m{m-l)c 

~^ W — 2 2^ 2 

777^ Z — / jji^ Z — / 
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and condition (b) implies that ■ 




b and cr^ • 



c. Therefore, 




2{q* - c) 



m 



and Q <b — a — ■ — ■ a'^ < Km Letting m — > oo, we arrive at contradiction. 
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